Two of the speech team from Speechmatics visited Hyderabad in India last week for the 2018 Interspeech Conference. Held in the state-of-the-art Hyderabad International Convention Centre, the conference brought over 5,000 delegates from around the world.
Key objectives were to learn about the latest developments in Automatic Speech Recognition (ASR), to discuss current trends, and to discover the future of speech.
Interspeech is the largest dedicated speech conference, addressing all aspects of speech science and technology, from fundamental theories through to advanced applications including computational modelling and technology development inspired by recent advances in artificial intelligence (AI) and machine learning (ML).
The Speechmatics team attended presentations on numerous research papers, discussed the latest speech developments with other researchers, and showed a technical demo of Speechmatics’ ASR. Major learnings from the conference included discovering that there is an interest in end-to-end solutions but it is still unclear as to whether it surpasses other approaches, and ascertaining future challenges with speech technology including the problems with far-field and noisy ASR, as well as under-resourced languages and domains.
The key highlights from the speech team comprised:
Seeing Speechmatics listed as one of the most accurate speech companies in the world in an Adobe Research paper
Learning about the future trends of speech technology
Witnessing speech technology being applied to more and more real applications and domains
Observing the ASR community getting bigger and better